Windows legacy audio components

This article describes audio APIs and components in Microsoft Windows which are now obsolete or deprecated.

Contents

Multimedia Extensions (MME)

The MME API or the Windows Multimedia API (also known as WinMM) was the first universal and standardized Windows audio API. Wave sound events played in Windows (up to Windows XP) and MIDI I/O use MME. The devices listed in the Multimedia/Sounds and Audio control panel applet represent the MME API of the sound card driver.

The Multimedia Extensions (WaveIn/WaveOut interfaces) were released in autumn 1991 to support sound cards, as well as CD-ROM drives, which were then becoming increasingly available. The Multimedia Extensions were released to Original Equipment Manufacturers (OEMs), mainly CD-ROM drive and sound card manufacturers, and added basic multimedia support for audio input and output and a CD audio player application to Windows 3.0. The Multimedia Extensions' new features were not available in Windows 3.0 real mode, only in 386 enhanced mode. Windows 3.1x would later incorporate many of its features. Microsoft developed the Windows Sound System sound card specification to complement these extensions.

MME lacks mixing multiple audio streams during playback and device sharing, so only one audio stream can be rendered at a time. But some sound card drivers can emulate more than one MME device so it could work with MME too. MME supports up to two channels of recording, 16-bit audio bit depth and sampling rates of up to 44.1 kHz with all the audio being mixed and sampled to 44.1 kHz.

Prior to Windows XP, the number of MME/WinMM device interfaces (waveIn, waveOut, midiIn, midiOut, mixer, and aux) is restricted to 10. This limit is raised from 10 to 32 in Windows XP.[1][2]

Audio Compression Manager

Audio Compression Manager (ACM) is a Windows multimedia framework that manages audio codecs (compressor/decompressors). ACM can also be considered an API specification. A codec must conform to the implicit ACM specification to work with Windows Multimedia. ACM files can be recognized by their filename extension .acm . ACM files also use RIFF-compatible filetypes such as WAV or AVI as a "wrapper" to store audio data encoded by any audio codec supported by ACM.

ACM is considered an outdated framework/API and Microsoft now encourages the use of at least DirectShow. However, unlike ACM and the related Video Compression Manager (VCM), DirectShow provides no means to encode files for end-users but requires developers to build end-to-end graphs for encoding content. ACM also does not support VBR audio streams; therefore newer codecs like MPEG-4 AAC, Ogg Vorbis, FLAC etc. cannot be supported through ACM if using variable bitrates. Though many sources state the contrary, Ogg Vorbis does work well with the ACM, e.g. when embedded in a RIFF-compatible file (such as a WAV or AVI file as mentioned earlier), provided the Ogg Vorbis stream is encoded at a constant bitrate.

DirectX Audio Libraries

KMixer

KMixer is the Kernel Audio Mixer driver, a part of WDM Audio in Windows 98 to Windows XP which handles the mixing of multiple sound buffers into an output.

The tasks performed by KMixer.sys:

In Windows 98, Windows 2000 and Windows Me, the KMixer maximum sample rate is 100 kHz. In Windows XP SP1 and later, KMixer audio sampling rate supports a maximum of 200 kHz .[3][4][5]

Issues

The KMixer was designed to aid the applications by relieving them from the need to perform the mixing of audio streams, especially on low-end sound cards that didn't support multiple sound streams. However, it introduced some significant problems.

First, the latency of KMixer is around 30 ms [6] and it cannot be reduced, because this component sits just right above the port class audio driver, so every audio stream, including those issued by DirectSound (except in cases of hardware mixing) and WinMM, come through the kernel mixer. [7] If the audio hardware supports hardware mixing (also known as hardware buffering or DirectSound hardware acceleration), DirectSound buffers directly to the rendering device. [8] Thus, if DirectSound streams use hardware mixing, KMixer is bypassed. [9]

In earlier releases like the original release of Windows 98, KMixer tried to mix every data format that passed through it, even those it did not support. It caused various problems with media players that tried to pass AC3-encoded surround sound streams through S/PDIF output of the sound card to an external home cinema receiver. This was corrected with Windows Me and provided as a hotfix for Windows 98 Second Edition and Windows 2000 SP2. [10] Starting with Windows Me, the waveOut, DirectSound, and DirectShow APIs support non-PCM formats such as AC-3 or WMA over S/PDIF and non-PCM data goes directly to the class driver instead of going through KMixer.

A new kernel-mode API, Direct Kernel Streaming, was also introduced in Windows 98 in order to bypass the KMixer and avoid problems associated with it.

It should be noted, however, that contrary to popular belief KMixer doesn't alter the sound in the majority of cases.[3] Also, there are many ways to bypass KMixer without the need of an extra plugin to access DirectSound, ASIO, Direct Kernel Streaming or WASAPI. In Windows XP, for example, the usage of DirectSound (which Winamp uses by default) with a hardware mixer is a way to bypass KMixer.[11]

KMixer was removed in Windows Vista. It is replaced by the user-mode WASAPI (Windows Audio Session API) Audio Engine which is part of the revamped audio architecture. The Audio engine can operate in Shared mode or Exclusive mode. In shared mode, mixing still takes place. Pre-mixed PCM audio is sent to the driver in a single format (in terms of sample rate, bit depth and channel count) that is configurable from the Sounds control panel. WASAPI Exclusive mode bypasses the mixer, as does using audio APIs like OpenAL or ASIO, which still have direct access to the hardware.[12]

Kernel Streaming

Kernel Streaming or Direct Kernel streaming (Direct KS) is a technique that supports kernel-mode processing of streamed data. It enables efficient real-time streaming for multimedia devices such as sound cards and TV tuner cards. Kernel streaming allows a device driver to create DirectShow-like filters and pins in kernel mode, providing access to hardware, lower latency communication and still be used within a DirectShow filter graph.

Kernel streaming was introduced in Windows 98. When the sound card uses a custom driver for use with the system supplied port class driver PortCls.sys or implements a mini-driver for use with the streaming class driver, applications can bypass the KMixer completely and use the kernel streaming interfaces instead to reduce latency. Windows 98 includes the first kernel streaming driver, Stream.sys. In Windows XP, Microsoft introduced another improved kernel streaming class driver, AVStream.

Music players such as JRiver Media Center, foobar2000 and Winamp support kernel streaming. Compared to the regular "WaveOut method" in Microsoft Windows, kernel streaming requires less CPU time. This comes at the expense of bypassing the KMixer and Windows volume control. Kernel streaming also does not allow device sharing.

See also

References

External links